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Abstract.  Superquantiles,  which  refer  to  conditional  value-at-risk  (CVaR)  in  the  same  way  that 
quantiles  refer  to  value-at-risk  (VaR),  have  many  advantages  in  the  modeling  of  risk  in  finance  and  en¬ 
gineering.  However,  some  applications  may  benefit  from  a  further  step,  from  superquantiles  to  second- 
order  superquantiles.  Measures  of  risk  based  on  second-order  superquantiles  have  recently  been  explored 
in  some  settings,  but  key  parts  of  the  theory  have  been  lacking:  descriptions  of  the  associated  risk  en¬ 
velopes  and  risk  identifiers.  Those  missing  ingredients  are  supplied  in  this  paper,  and  moreover  not  just 
for  second-order  superquantiles,  but  also  for  a  much  broader  class  of  mixed  superquantile  measures  of 
risk.  Such  dualizing  expressions  facilitate  the  development  of  dual  methods  for  mixed  and  second-order 
superquantile  risk  minimization  as  well  as  superquantile  regression,  a  proposed  second-order  version  of 
quantile  regression. 
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1  Introduction 

The  second-order  version  of  conditional  value-at-risk  that  we  introduced  in  [20],  with  further  explana¬ 
tions  in  [19,  22],  corresponds  to  a  sort  of  smoothing  of  the  cumulative  distribution  function  of  a  random 
variable  but  has  other  key  interpretations  as  well.  Motivated  by  applications  in  risk-averse  optimization 
and  regression  in  engineering,  we  develop  it  further  here  with  particular  attention  to  duality.  The  term 

1This  material  is  based  upon  work  supported  in  part  by  the  U.  S.  Air  Force  Office  of  Scientific  Research  under  grants 
FA9550-1 1-1-0206  and  F1ATA01194G001  and  DARPA  under  grant  HR0011517798. 
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“superquantile”  as  an  alternative  to  “conditional  value-at-risk”  [18]  is  employed  for  this  broad  purpose, 
beyond  the  usual  domain  of  finance. 

To  understand  the  second-order  ideas  with  which  we  will  be  occupied  in  this  paper,  some  back¬ 
ground  in  the  first-order  ideas  is  needed,  and  we  begin  briefly  with  that.  The  conditional  value-at-risk 
CVaRQ(V)  of  a  random  variable  X  oriented  to  “loss”  or  “cost,”  at  a  probability  level  a  €  [0,1),  is 
the  expected  value  of  the  a-upper  tail  distribution  of  X  as  defined  in  [23,  24],  When  the  cumulative 
distribution  function  Fx  for  X  is  continuous  at  VaRQ(V),  the  value-at-risk  of  X  at  level  a,  this  tail 
distribution  is  simply  the  conditional  distribution  for  X  with  respect  to  the  interval  [VaRa,oo),  but 
otherwise  it  requires  taking  into  account  an  atom  of  probability  at  VaRa(V).  This  distinction  makes 
CVaR  different  from  other  notions  introduced  around  the  same  time,  such  as  “tail- VaR”  [2],  which 
includes  the  entire  probability  atom,  and  “mean  shortfall”  [11],  which  omits  it  (although  the  similar 
term  “expected  shortfall”  has  been  ambiguous  in  this  respect).  Conditional  value-at-risk  can  also  be 
expressed  by  the  formula 

1  f1 

CVaRa(V)  =  - -  /  VaR p(X)dp 

l  a 

of  [1],  which  was  adopted  by  Follmer  and  Schied  as  the  definition  of  “average”  value-at-risk  [6]. 2 

These  other  concepts  were  originally  articulated  for  random  variables  oriented  toward  gain,  but  the 
loss  orientation  we  follow  here  has  the  advantage  of  making  the  value-at-risk  VaRQ  (V)  coincide  with 
the  a- quantile  qx(a)  familiar  in  statistics: 

VaRQ(V)  =  qx{cy)  =  min{.T  €  JR\Fx(x)  >  a}. 

This  uniting  of  VaR  with  quantiles  has  further  suggested  a  way  of  exiting  from  finance-driven  termi¬ 
nology  about  risk  for  the  sake  of  applications  outside  of  finance,  namely  by  speaking  of  the  conditional 
value-at-risk  CVaRa(X)  as  the  a- superquantile  of  X  in  the  parallel  notation  qx(a).  Then  the  integral 
formula  for  CVaR  becomes 

Qx (a)  =  [  qx(/3)d/3.  (1) 

1  ~a  J  a 

With  this  shift  we  have  a  platform  for  displaying  the  second-order  superquantiles  of  [20],  to  be 
denoted  by  qx(a);  they  are  defined  by 

<lx  (a)  =  z  ~  [  (Ix(P)dfl-  (2) 

1-aJa 

There  is  more  to  the  second-order  superquantile  than  just  the  analogy  between  (1)  and  (2),  though,  as 
has  been  laid  out  in  [20]. 

Especially  of  interest  is  a  formula  derived  in  [20]  that  extends  to  superquantiles  and  second-order 
superquantiles  the  basic  connection  between  VaR  and  CVaR  discovered  in  [23,  24] .  That  earlier  formula 
asserts,  in  quantile/superquantile  notation,  that 

qx(oi)  =  min{c  +  Va{X  —  c)},  qx(oi)  =  argminjc  +  Va(X  —  c)},  (3) 

ce-R  ceR 

2They  preferred  “average”  because  “conditional”  could  have  differing  usages.  This  issue  also  adds  motivation  to  our 
passage  to  “superquantiles.” 
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in  terms  of  the  “regret”  functional 

Va(X)  =  — 1 - — £[max{0,X}]  =  — 1 - —  [  max{0,  qx(/3)}d/3 
i  —  a  1  —  a  Jq 

(with  the  argmin  being  an  interval  which,  if  not  a  singleton,  has  the  quantile  in  question  as  its  left 
endpoint).  The  second-order  extension  asserts  that 


qx(a)  =  min{c  +  Va(X  -  c)},  qx(c v)  =  argmin{c  +  Va(X  -  c)},  (4) 

CSi?  cS-R 

where 

If1 

Va(X)  =  - -  /  max{0  ,qx(P)}dp.  (5) 

Achieving  such  a  formula  had  been  one  of  our  main  goals  in  pursuing  second-order  superquantilies, 
because  it  is  deeply  tied  to  generalized  regression.  The  joint  formula  (3)  is  central  to  quantile  regression, 
a  well  known  alternative  to  ordinary  least-squares  regression,  so  the  joint  formula  (4)  indicates  a  possible 
elevation  to  superquantile  regression.  The  double  formula  (4)  was  developed  in  [20]  through  a  technique 
in  which  the  superquantiles  of  X  could  be  interpreted  as  the  quantiles  of  a  “super”  random  variable  X 
associated  with  X. 

Our  primary  aim  in  this  paper  is  to  fill  in  missing  parts  of  the  second-order  theory  concerned 
with  duality.  An  important  ingredient  of  duality  for  any  coherent  measure  of  risk,  including  superquan- 
tile/CVaR,  is  an  expression  of  the  risk  as  a  worst-case  expectation  over  an  associated  class  of  probability 
measures.  This  requires  identifying  the  “risk  envelope”  that  characterizes  that  class.  The  risk  envelope 
for  the  risk  measure  given  by  second-order  superquantiles  has  not  yet  been  understood,  but  we  will  pin 
it  down  here. 

This  pushes  us  naturally  into  wider  terrain  in  observing  that  the  integral  formula  for  the  second- 
order  superquantile  casts  it  as  a  special  “spectral”  measure  of  risk  of  X  in  the  sense  of  Acerbi  [1]. 
Spectral  measures  of  risk,  which  have  also  been  studied  from  various  angles  under  the  heading  of  mixed 
superquantile/CVaR  measures  of  risk  [27,  25].  They  are  known  to  be  fundamental  for  characterizing 
coherent  measures  of  risk  that  are  law-invariant  [10,  6,  9,  16,  14,  31]. 3  However,  their  risk  envelopes 
have  not,  until  now,  been  determined.  We  therefore  take  on  first  the  task  of  doing  that  and  are  able 
then  to  pass  to  the  risk  envelope  in  the  second-order  superquantile  case  essentially  as  a  corollary. 

Although  dualization  of  risk  measures  can  be  carried  out  for  a  variety  of  spaces  of  random  variables 
and  paired  dual  spaces  (see  for  example  [30,  4,  8]),  we  focus  here  on  random  variables  with  finite  second 
moments.  A  compelling  reason  in  our  setting  is  that  this  restriction  guarantees  the  finiteness  of  second- 
order  superquantiles.  That  follows  from  their  expression  as  an  integral  of  first-order  superquantiles  and 
the  bounds  derived  for  the  latter  under  such  restriction  in  [22,  Proposition  1],  namely 


vi 


(6) 


where  cr(X)  denotes  standard  deviation  and  the  lower  bound  is  strict  for  nonconstant  X  unless  a  =  0. 
Another  plus  is  that  this  choice  allows  for  random  variables  with  normal  distributions,  whereas  much 
of  the  literature  in  finance  restricts  consideration  to  random  variables  with  essentially  bounded  range. 

3An  insightful  exposition  of  this  topic  has  been  provided  by  Follmer  and  Schied  in  [7,  Section  4.5].  In  that  special 
context,  in  contrast  to  here,  the  treatment  of  mixed  superquantile/CVaR  risk  is  restricted  to  random  variables  on  a 
nonatomic  probability  space. 
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In  Section  2  we  lay  the  foundation  for  working  in  this  framework  and  the  measures  of  risk  that  fit  into 
it.  We  proceed  in  Section  3  with  the  central  results  of  duality  concerning  risk  envelopes  and  the  “risk 
identifiers”  they  associate  with  random  variables.  Section  4  then  applies  the  results  to  optimization 
and  regression.  An  appendix  collects  some  of  the  technical  details  that  are  needed  along  the  way. 

2  Risk  Measure  Framework 

For  a  probability  space  (fi,  F,  P),  we  let 

C2  =  £2{Q,F,F)  :=  {X  :  SI  ->  M  \  X  ^-measurable,  E[X2]  <  oo} 

be  the  space  of  random  variables  with  finite  second  moment,  where  we  write  integration  with  respect  to 
P  using  the  standard  notation  E[X\  =  X(cj)dP(cn).  We  equip  C2  with  the  standard  norm  ||X||2  := 
(E{X2\)1/2 .  As  explained  in  the  introduction,  the  choice  of  C2  ensures,  through  (6),  the  finiteness  of 
the  second-order  superquantiles  qx(ot)  we  are  especially  focused  on. 

In  the  following,  we  deal  with  classes  of  measures  of  risk  defined  on  C2 .  Regularity  [25,  21]  provides 
fundamental  properties  for  such  risk  measures.  We  recall  that  a  measure  of  risk  1Z  :  C?  — >•  (— oo,  oo]  is 
regular  if  it  satisfies  the  following  axioms: 

7Z{X)  =  c  for  constant  random  variables  X  =  c, 

7Z((1  —  t)X  +  tX')  <  (1  —  t)'JZ(X)  +  tTZ(X')  for  all  X ,  X'  G  C2  and  r  G  (0, 1)  (convexity), 

{X  G  C2  |  1Z(X)  <  c}  is  closed  for  all  c  £  M  (closedness), 

7Z(X)  >  E[X]  for  nonconstant  X  G  C2  (averseness), 

which  have  as  a  consequence  that  7 Z(X  +  c)  =  1Z(X)  +  c  for  all  c  G  7R.  In  fact  we  will  only  be  working 
here  with  risk  measures  that  in  addition  are  both  positively  homogeneous , 

7Z(tX)  =  rU(X)  for  r  >  0,  X  G  £2, 


and  monotonic , 

1Z(X)  <  7Z(Y)  whenever  X(tv)  <  Y(uj)  for  a.e.  uj  G  Q. 

In  particular  7Z  is  then  a  coherent  measure  of  risk  in  the  sense  of  [2].  Duality  in  this  case  is  expressed 
by  the  following  correspondence  between  risk  measures  7Z  and  sets  Q  called  their  risk  envelopes .4 

2.1  Theorem  (risk  envelope  duality).  For  a  regular  measure  of  risk  1Z  on  C2  that  is  positively  homo¬ 
geneous  and  monotone,  the  relations 

1Z{X)  =  sup  E[XQ]  for  X  G  £2,  Q  =  {Q  G  C2  \  E[XQ }  <  7Z(X)  for  all  X  G  £2}, 

Q&Q 

give  a  one-to-one  correspondence  between  the  regular  measures  of  risk  7Z  on  C2  that  are  positively 
homogeneous  and  monotonic  and  the  nonempty  closed  convex  subsets  Q  of  C2  that  consist  of  elements 
Q  >  0  with  E[Q\  =  1  and  are  such  that  each  nonzero  Ig£2  has  E[XQ\  >  0  for  at  least  one  Q  G  Q. 

4The  term  “risk  envelope”  was  introduced  in  2002  in  [26]. 
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This  fact,  a  specialization  of  the  general  support  function  correspondence  in  convex  analysis,  is 
a  variant  from  [25]  of  known  results  characterizing  other  classes  of  risk  measures,  starting  with  [2]. 
Important  along  with  the  risk  envelope  Q  associated  with  1Z  are  the  sets 

Qx  =  argma xE[XQ]  for  X  €  £2,  (7) 

QeQ 

which  are  called  the  risk  identifiers  for  the  individual  random  variables  X ,5 

The  measures  of  risk  at  the  center  of  our  attention  are  the  first-order  superquantile  measures  7 Za 
and  the  second- order  superquantile  measures  1Za  given  by 

7 Za(X)  =  qx{ot)  and  7 Za(X)  =  qx{ot)  for  a  G  [0, 1)  (8) 

in  accordance  with  the  expressions  (1)  for  q- \{a)  and  (2)  for  qxipt)  in  Section  1.  The  properties  of 
lZa  that  make  it  regular,  positively  homogeneous,  and  monotonic  have  been  known  for  some  time,  and 
those  properties  are  obviously  inherited  by  TZa  through  the  expression  of  qx(cx)  as  an  integral  in  (2). 
In  both  cases,  therefore,  we  are  dealing  with  measure  of  risk  covered  by  the  preceding  theorem.  For 
7 ZQ,  the  risk  envelope  is  known  to  be 

Qa  ■=  {Q  £  -C2  |  0  <  Q(uj)  <  1/(1  —  a)  a.e.  u  G  Q,  E[Q\  =  1},  (9) 

cf.  [26,  25].  For  7 Za,  the  corresponding  risk  envelope  Qa  will  be  determined  for  the  first  time  in 
Section  3.  However,  to  accomplish  this  efficiently  and  gain  other  new  insights  at  the  same  time,  we  will 
pass  through  a  broader  class  of  risk  measures  as  follows. 

2.2  Definition  (mixed  superquantile  measures  of  risk).  For  a  weighting  measure  A,  namely  a  probabil¬ 
ity  measure  on  ([0, 1),  £>[0ii)),6  the  associated  mixed  superquantile  measure  of  risk1  1Z  :  C?  — >•  (—00,00] 
is  given  by 

7Z(X)  :=  [  qx(fi)  d\(f3).  (10) 

Jo 

For  technical  reasons,  we  exclusively  deal  in  this  situation  with  the  completion  of  ([0, 1  ),Hr01),  A), 
which,  with  a  slight  abuse  of  notation,  we  denote  by  ([0, 1),  A). 

The  key  thing  here  for  our  purposes  is  that  second-order  superquantile  risk  fits  this  definition 
because,  through  (2),  we  have 

Ka(X)  =  qx(ot)  =  [  qx (a)  dXa{/3),  where  A a(S)  :=  m^  n  ^  ^  for  S  €  H[0ii).8  (11) 

Jo  1  -  « 

As  another  special  case,  if  A  is  concentrated  on  a  finite  number  of  points  in  [0, 1),  say  aq,  •••,  Q'fc , 
then  simply  7 Z(X)  =  A(ai)gx(o;i)  +  •••  +  \(ak)qx{ctk)-  A  first-order  superquantile  risk  measure  is 
realized  by  setting  k  =  1. 

sThis  term  was  introduced  in  [27],  although  the  sets  in  question  were  handled  earlier  as  being  the  subdifferentials  of 
convex  analysis  for  the  risk  measure  functionals  in  question. 

6For  a  set  S  with  a  topology,  let  Bs  be  its  Borel  sigma-algebra. 

7 Also  called  a  spectral  measure  of  risk  [1]  and  Choquet  representation  of  distortion  acceptability  functionals  [15]. 
sHcre,  and  throughout  the  paper,  m  denotes  Lcbesgue  measure. 
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Note  in  general  that,  since  A  is  defined  on  £>[o,i);  we  exclude  the  possibility  of  a  weighting  measure 
that  places  a  positive  weight  at  a  =  1.  That  case  simply  yields  'TZ(X)  =  oo  whenever  sup  A  =  oo,  and 
it  is  better  treated  separately. 

The  basic  properties  of  a  mixed  superquantile  risk  measure  are  described  by  the  following  result, 
which  extends  a  previous  results  in  [26,  27]  by  dealing  with  a  significantly  relaxed  condition  for  finiteness 
and  admitting  the  point  (3  =  0  explicitly. 

2.3  Theorem  (mixed  superquantile  properties).  A  mixed  superquantile  risk  measure  IZ  as  in  (10)  is 
well-defined,  monotonic  and  positively  homogeneous.  It  is  regular  if  A({0})  <  1,  but  lacking  averseness 
if  A({0})  =  1.  Specifically, 

7Z(X )  >  E[X]  for  all  X  G  C2  and  IZ(X)  >  E[X ]  for  nonconstant  X  unless  A({0})  =  1. 

It  is  finite  on  C?  whenever  the  weighting  measure  A  satisfies 

l  TITs dXW  <  °° 

and,  regardless  of  the  weighting  measure,  has  IZ(X)  <  oo  whenever  sup  X  <  oo. 

It  has  the  alternative  expression 

IZ(X)  =  f  qx{/3)ip(/3)d/3,  where  p(/3)  :=  f  — 1 —  d\(a),  /3  <E  [0,1]. 

Jo  J o<a</3  1  ~  a 

The  risk  profile  function  ip  is  right-continuous  and  nondecreasing  on  [0, 1]  with  p( 0)  =  0  and  satisfies 
fg(  1  —  a)dp(a)  =  1.  Conversely,  any  p  with  these  properties  arises  from  a  unique  weighting  measure 
A  given  by  d\(a)  =  (1  —  a)dp(a). 

The  proof  of  this  theorem,  similar  in  some  ways  to  that  of  previous  versions  but  containing  new 
parts,  is  provided  in  the  Appendix.  Further  clarification  of  properties  of  mixed  superquantile  measures 
of  risk  has  been  furnished  in  [25,  Mixing  Theorem], 

Next  on  the  agenda  is  applying  this  general  result  to  the  case  in  (11)  that  corresponds  to  second- 
order  superquantile  measures  of  risk. 


2.4  Theorem  (second-order  superquantile  properties).  Any  second-order  superquantile  risk  measure 
1Za  :  C?  — >•  R,  a  €  [0, 1),  is  regular,  monotonic,  and  positively  homogenous,  and  satisfies  for  X  €  C? 

E[X]  <  qx(o)  =  Ha(X)  <  min  j-Epf]  +  ^==!  sup X  j  , 

with  the  lower  bound  holding  with  strict  inequality  whenever  X  is  nonconstant. 

It  has  the  alternative  expressions 


na(x)  = 


i 


1  —  a 


Qx(P)  log 


1  —  a 


d/3=  qx(/3)pa(/3)d/3, 
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with  respect  to  the  risk  profile  function 


if  a  <  f3  <  1 
if  0  <  f3  <  a. 


Va{P)  ■= 


WFlog 

0 


l— a 
1-0 


Moreover,  ipa  is  a  nondecreasing,  finite  convex  function  on  [0, 1]  with  right-derivative  equal  to  1/(1  —  a)2 
as  it  starts  to  grow  from  0  at  (3  =  a. 

Proof.  As  a  special  case  of  Theorem  2.3,  it  follows  automatically  that  TZa  is  well-defined,  regular, 
monotonic,  positively  homogeneous,  and  bounded  from  below  by  E[X].  From  (6), 


na(x)  < 


1  —  a 


E[X]  + 


v/W 


d/3  =  E[X]  + 


o{X)  f1  1 
1  -  a  J a  \/l  -  P 


dX(p)  =  E[X\  + 


2  <r(X) 

v/1  —  a 


Obviously,  TZa(X)  <  sup X  also  holds. 

The  alternative  expression  follows  after  a  specialization  of  ip  of  Theorem  2.3  for  the  given  choice  of 
weighting  measure  A  =  Xa.  Specifically, 


<fi(P)  =  I  —^—d\a(  7)  =  <pa(P) 

J  O<"f<0  1  —  7 


j^—j^—d'-y  if  a  <  f3  <  1 

Jq  1— 7  1—a  1  —  r 

0  if  0  </3<a. 


Since  for  0  <  a  <  b  <  1, 


1 


dp 


log 


1  —  a 
1-6’ 


we  therefore  find  that  the  alternative  expressions  follow. 

The  assertion  about  (pa  being  convex  is  justified  by  its  derivative  being  zero  for  (3  €  (0,  a)  and 
1/((1  —  a)(  1  —  /?))  for  (3  €  (a,  1),  with  left-  and  right-derivatives  at  ft  =  a  equal  to  0  and  1/(1  —  a)2, 
respectively.  □ 

The  upper  bounds  on  TZa  and  TZa  in  (6)  and  Theorem  2.4,  respectively,  are  remarkably  similar,  and 
show  that,  although  second-order  superquantile  risks  are  larger  than  first-order  risks,  the  difference  is 
at  most  o[X)!\J  1  —  a. 


3  Dualization  Through  Risk  Envelopes 

We  now  turn  to  determining  the  dual  expressions  for  mixed  and  second-order  superquantile  risk  mea¬ 
sures  in  terms  of  the  risk  envelopes  described  in  general  in  Theorem  2.1.  The  risk  envelope  Qa  that 
corresponds  to  the  first-order  superquantile  measure  7Za  in  (8)  has  already  been  indicated  in  (9). 

Another  case  where  the  risk  envelope  is  already  known  is  that  of  a  mixed  superquantile  measure 
TZ  associated  with  a  weighting  measure  A  that  is  concentrated  in  finitely  many  points.  Namely,  if 
TZ  =  Ai 7Zai  +  ■  ■  ■  +  XkTZak,  the  corresponding  risk  envelope  is  Q  =  AiQai  +  •  •  •  +  A kQak-  This  follows 
immediately  from  general  principles  of  convex  analysis  and  has  been  recorded  explicitly,  for  instance, 
in  [25], 

No  other  cases  have  so  far  been  worked  out,  and  with  good  reason.  The  risk  envelope  for  a  mixed 
superquantile  TZ  coming  from  a  weighting  measure  A  that  is  not  merely  discrete  ought,  by  analogy,  to 


7 


be  a  sort  of  “continuous  sum”  or  integral  of  various  sets  Qa  of  the  form  in  (9),  and  the  contemplation 
of  such  an  expression  raises  serious  technical  challenges  in  integration  theory. 

We  take  on  those  challenges  here,  but  with  some  of  the  technical  background  details  placed  in  the 
Appendix.  Let 


M  :=  lq  :  [0,1)  ->  C2 


q  is  £>£ 2)  -measurable, 


2  dX(/3)  <  00 


Observe  that  M.  is  well-defined  because  by  Lemma  A. 5  (the  “A”  points  to  the  Appendix),  the  mapping 
/ 3  i->-  ||g(/3)||2  is  S[0)  1 ) -measurable  whenever  q  is  (/3r0ii), #£2) -measurable. 

We  are  now  ready  to  deal  with  the  risk  envelope  of  a  mixed  superquantile  risk  measure  TZ  and  for 
this  purpose  utilize  a  collection  of  random  variables  in  terms  of  (Bochner)  integrals  of  elements  of  M. 
In  the  following,  we  let  1R  =  TR  U  {—00,  00}. 


3.1  Theorem  (risk  envelope  for  mixed  superquantiles).  For  a  mixed  superquantile  measure  of  risk  TZ 
with  associated  weighting  measure  X,  let 9 

Q  =  j  q(/3 )  dX(/3),q  G  M,q(/3)  G  Qp  for  X-a.e.  /3  G  [0, 1)  j  , 

where  cl  denotes  closure  with  respect  to  the  (strong)  topology  on  C? .  Then  Q  is  nonempty,  convex, 
and  is  the  risk  envelope  for  TZ,  i.e.,  for  any  X  G  C2 , 


Q:=c\lQeC 


TZ(X)  =  sup  E[XQ\. 
QeQ 


Moreover,  if  l/\/l  —  a  dX(a)  <  00,  then  Q  is  also  weakly  compact. 
Proof.  Let  X  G  C2  and  /  :  [0, 1)  x  C2  — >•  TR  be  defined  by 


f{a,Q) 


—E[XQ]  if  QeQa 
00  otherwise. 


In  view  of  Definition  A. 3,  /  is  a  normal  integrand  because  (i)  /  is  (£>[o,i)  &  £>£2)-irieasurable  as  the  sum 
of  the  continuous10  function  -E[X-]  on  [0, 1)  x  £?  and  an  indicator  function  vanishing  on  the  set 

{(/3,  Q )  G  [0, 1)  x  C2  |  Q  G  Qp}  G  B[0,i)  ®  2 

and  infinity  elsewhere,  (ii)  f(/3,Q)  >  —E[XQ\  >  —00  for  fi  G  [0,1)  and  Q  G  C2 ,  and  (iii)  for  all 
P  €  [0,1),  f(/3,  •)  is  lower  semicontinuous  by  the  continuity  of  E[X-]  on  C?  and  the  closedness  of 
Qp  C  C2 ,  and  /(/3,  •)  is  not  identical  to  00  with  Q  =  1  G  Qp  furnishing  a  finite  value  f(f3, 1)  =  —E[X\. 
In  view  of  Proposition  A. 6  and  the  fact  that  q  =  1  provides  an  element  of  M.  with  f  f(/3,  q(j3))  dX(/3)  = 


9 We  note  that  Q  resembles  the  Aumann  integral  (see  for  example  [3])  of  the  set-valued  mapping  j3  Qp. 

10Here  continuity  is  with  respect  to  the  product  topology  of  the  norm-topologies  on  [0, 1)  and  C2 . 


—E[X]  <  oo,  Proposition  A. 4  applies.  Consequently,  the  interchange  of  integration  and  minimization 
is  permitted  and  we  obtain  that 

TZ(X)  =  [  sup  E[XQg\  dX(/3 )  =  -  f  inf  f(/3,Q)  dX(/3) 

J  Qp&Qp  J  <?e£2 

=  -  inf  f  f{/3,q(/3 ))  dX(/3). 
q&M  J 

We  next  consider  the  interchange  of  integration  with  respect  to  A  and  P.  For  q  €  M,  it  follows  from 
Lemma  A. 5  that  the  function  (/3,w)  H >  \ X(u>)q(/3)(uj)\  is  measurable.  By  Tonelli-Fubini’s  Theorem  and 
Cauchy-Schwartz  inequality, 

J  \X(u>)qmu>)\d{\  X  F)(/3,uj)  =  J  E[\Xq(P)\]  dX(/3)  <  \\X\\2  j  \\q(P)\\2  dXtf)  <  oo, 

where  the  finiteness  follows  by  the  property  of  q  G  M.  Then  by  Tonelli-Fubini’s  Theorem, 

J  E[Xq(0 )]  dX(0)  =  E  X  J  q(/3)  dXtf)  . 

Since 

j  f(/3,qm  dX(0)  =  J  E[Xq(P)}  dX(j3) 

whenever  q  G  M.  is  such  that  q{j3)  €  Qg  for  A-a.e.  /3  €  [0, 1)  and  f  f(/3,  q(/3 ))  dX(/3)  =  oo  otherwise,  we 
find  that 


inf  [  /(/?,  <?(£))  dX(P)  =  inf  {  [  E[-Xq(P)}  dX(/5)  +  i(q) 

q&Mj  q£M  { J 


where 


t(?)  = 


=  inf  <  -E  X  /  q(/3)  dX(/3 )  +  t(q) 
q£M  I  J 


0  if  q((5)  €  Qg  for  A-a.e.  f3  £  [0, 1) 
oo  otherwise. 


Compiling  the  above  results,  we  see  that 


TZ(X)  =  —  inf  f  f(/3,q(/3 ))  dX(/5)  =  sup  <E  X  f  q({3 )  dX(/3)  —  1  =  supF’fAQ], 

<l£Mj  '  q(zM  {  l  J  J  J  Q(ZQ 

The  convexity  of  Q  follows  from  the  convexity  of  Qg.  Since  1  €  Q,  Q  is  not  empty.  Under  the 
additional  assumption  that  f  1/Vl  —  a  dX(a)  <  oo,  1Z  is  finite- valued  on  £?  and  even  locally  bounded 
around  the  origin  of  £2  by  Theorem  2.3.  This  local  boundedness  for  a  positively  homogeneous  convex 
function,  as  the  support  function  of  a  set  Q,  corresponds  to  that  set  being  bounded.  Consequently, 
Q  is  bounded.  Since  Q  is  convex,  weak  closedness  follows  from  strong  closedness  and  therefore  weak 
compactness  is  established.  □ 

For  the  special  case  of  a  second-order  superquantile  risk  measure  we  then  obtain  the  following 
corollary. 
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3.2  Corollary  (risk  envelope  for  second-order  superquantiles).  For  a  £  [0, 1),  the  risk  envelope  for  7Za 
is  given  by 


Qa  ■=  cN  Q  £  C2  Q  = 


1  —  a 


q(/3)d/3,q  £  M,  q{(3)  £  Qp  for  m-a.e.  /3  £  [a,  1)  >  . 


Moreover,  Qa  is  a  nonempty  weakly-compact  convex  subset  of  C2 . 

In  addition  to  the  trivial  cases  when  A  and/or  P  are  positive  only  on  a  finite  number  of  points  in 
[0, 1)  and  f 2,  respectively,  the  closure  in  the  definition  of  Q  is  unnecessary  under  the  following  condition. 


3.3  Proposition  (dispensing  with  the  closure  operation).  Suppose  that  A  is  nonatomic  and  also  that 
fo  1/(1  —  a)  dX(a)  <  oo.  Then  the  closure  operation  is  superfluous  in  the  expression  of  the  envelope  in 
Theorem  3.1.  One  can  simply  take 

Q  =  | Q  £  C2  Q  =  j  q(p)  d\(/3),  q  £  M,  q(/3)  £  Qp  for  X-a.e.  j3  £  [0, 1) 

Proof.  By  [5],  an  integrably  bounded  ,8[0  ^-measurable  set- valued  mapping  S  :  [0, 1)  =$  C2 ,  with  closed 
and  convex  values,  satisfies 


cl  |  J  S(a)  dX(a)  j  =  J  S(a)  dX(a) 


when  A  is  nonatomic.  Take  S  to  be  the  mapping  a  >-)•  {q{oi)  \  q  £  JA,q(a )  £  Qa },  which  obviously  is 
closed  and  convex  valued  by  the  properties  of  Qa.  Moreover,  since  both  [0, 1)  and  C2  are  separable, 
there  exists  a  countable  collection  q 1  £  M. such  that  5(a)  =  cl{g*(a)  |  i  =  1,2,...}  for  A-a.e. 

a  £  [0, 1).  Thus,  S  is  B[o, ^-measurable;  see  for  example  [17,  Theorem  1],  The  mapping  S  is  integrably 
bounded  if  there  exists  a  /^immeasurable  g  :  [0, 1)  — >  1R  with  f  g(a )  dX(a )  <  oo  and 

sup  || Q || 2  <  g{a)  for  A-a.e.  a  £  [0, 1). 

QeS(a) 

Since  for  our  choice  of  S  we  have  that  every  Q  £  S(a)  has  Q(u)  <  1/(1  —  a)  for  a.e.  u  £  ST,  integrably 
boundedness  holds  with  g(a)  =  1/(1  —  a)  under  the  imposed  restriction  on  A.  □ 

Next,  we  turn  to  specific  expressions  for  risk  identifiers.  Recall  from  (7)  that  for  any  X  £  C2  and 
positively  homogeneous  regular  measure  of  risk  ou  C2,  a  Q  in  the  risk  envelope  of  the  risk  measure 
that  maximizes  E[XQ\  is  called  a  risk  identifier  at  X.  We  again  start  with  the  building  blocks  from 
first-order  superquantile  risk  measures. 

For  X  £  C? ,  the  set 

Qa  :=  argrna xE[XQ] 

QeQa 

is  convex  and  nonempty  with  its  elements  referred  to  as  risk  identifiers  of  lZa.  Before  we  characterize 
these  risk  identifiers,  we  introduce  additional  notation. 

For  (3  £  (0, 1),  let 

(X)  :=  {u£  Q  |  X(u)  =  qx(P)} 

and  let 

Ffi  (x)  :=  lim  Fx(x'),  x  £  1R 

x'  X  X 
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be  the  left-continuous  “companion”  of  the  cumulative  distribution  function  Fx,  where  the  limit  exists 
by  the  virtue  of  F\  being  nondecreasing  and  bounded  from  above.  For  F\  continuous,  Fx  =  F x  of 
course. 

The  risk  identifiers  of  lZa  are  then  characterized  as  follows;  see  also  [30,  Equation  4.21]  for  closely 
related  expressions. 


3.4  Proposition  For  X  £  C2  and  p  G  (0, 1),  let  r?  G  C2  be  such  that 


0  <  r*  (ui)  <  - -  for  a.e.  u  £  Cl  and 

1  p 


IClp{X) 


r$(u)dP(u)  = 


Fx{qx(P))  -  P 

1-/3 


Every  such  r? ,  defines  a  unique11  Q «  13  G  C2  given  for  a.e.  c v  £  Cl  by 


(  i 
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if  X(u)  >  qx{P) 

if  X(uj)  =  qx(P)  a-iid  P({cu})  >  0 
otherwise. 


(12) 


(13) 


Then, 


Moreover, 


Qq  =  G  £2  Q  =  Qp’  0  for  some  r *  €  C?  satisfying  (12)  |  . 


Qq  =  {Q  £  C2  |  Q(u )  =  1  for  a.e.  u  €  fl}. 


Proof.  Let  /3  £  (0, 1)  and  X  £  C2 .  We  first  show  that  there  exists  an  G  C2  satisfying  (12).  For 
wGSl  satisfying  X(u >)  =  qx(P)  and  P({w})  >  0,  Fx(X(co))  <  P  <  Fx{X(u)),  with  at  least  one  of  the 
inequalities  being  strict,  and 


Fx{X{u))  -  /3 

(1  -P)(Fx(X(u,))-Fx(X(u))) 


G  [0, 1/(1  —  P)\- 


Let  f-p  G  C2  be  defined  for  a.e.  wG  11  by 


fX{u)  :=  { (i-w^wi»-^(.VM))’  if  x(w>  =  alld  P(M>  >  ° 

0  otherwise. 


(14) 


Clearly,  fp  satisfies  0  <  f?(u>)  <  1/(1  —  /3)  for  a.e.  wGfl.  Moreover, 

Fx{qx{P))  -  P 


%(X)  (1  -  P)(Fx{qx(P))  ~  Fx(qx{P))) 


dP(w)  = 


Fx{qx(P))  -  P 
l -P 


/  t$(«>)dF(u)  = 

JtopiX) 

and  f'p  therefore  satisfies  (12). 

11  With  C2  consisting  of  equivalence  classes  of  functions  identical  up  to  on  a  set  of  IP-measure  zero,  uniqueness  of  course 
is  in  the  sense  of  such  equivalence  classes. 
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X  rx 

Let  rp  £  C?  satisfy  (12).  Since  0  <  Qp  p  (u ;)  <  1/(1  —  /3)  for  a.e.  u  £  Q  and 

1 


Q*,r*  (u)dP(u>)  = 


I  1  + 

|  x(uj)>qxm  1  “  P 

1  -  Fx(qx{P))  ,  Fx{qx (/?))-  P 


rp(io)dP(u) 


1-/3 


+ 


1-/3 


=  1, 


we  find  that  p  £  Qp-  Moreover, 

x  rx" 


£ 


/? 


'hefl  |  xM>,x(ft}  1  -Z3 

1  I- 


dP(u;)  + 


X(uS)rjp  (lu)cflP(cj) 


Iftp(X) 


1  “  P  J{we n  |  A»>gx(/3)} 

/oo 

X  dF^(x), 

-OO 


X(w)(flP(u;)  +  gx(/3) 


^.v(gx(/3))  -  /3 
1-/3 


where 


*£(*)  :  = 


13  if  Tx(x)  >  /3 

v0  if  Fx(x)  <  j3. 

It  is  well  known  (see  [24])  that  the  superquantile  qxifi)  =  fXx  dF^(x).  Thus,  we  have  proved  that 

X rx  X rx 

Qp  13  maximizes  E[X-\  over  Qp.  Any  Q  £  Qp  not  equal  to  Qp'  13  for  any  r *  must  necessarily  have 

E[XQ\  <  <j  \  (  3). 

The  case  of  /3  =  0  follows  also  as  then  Qo  =  {Q  £  £2  |  0  <  Q(uj)  <  1  for  a.e.  a j  £  ft,  E[Q]  =  1}.  □ 

A  particular  element  of  Qp  plays  a  central  role  in  the  following.  Let  fp  £  L '?  be  as  defined  in  (14). 
Consequently  by  Proposition  3.4,  Q p  defined  for  a.e.  w  £  by 


T=p  if  X(u)  >  qx{P) 

<  f-p(u)  if  X(uj)  =  qx(fd)  and  P({cn})  >  0 
k  0  otherwise 


(15) 


is  a  point  in  Qp  .  Moreover,  let  Qif  £  C2  be  defined  by  Qq(uj)  =  1  for  a.e.  u  £  Q,  which  therefore  by 
Proposition  3.4  is  a  point  in  Qq  .  The  random  variable  Q’p  behaves  continuously  in  /3  in  a  sense  given 
next. 


3.5  Proposition  If  (3V ,/3  €  [0, 1)  and  (3U  — >  /3,  then  for  any  X  £  C2 ,  \\Qpv  —  Q* || 2  — >  0. 

Proof.  Let  X  £  C2  and  f-jp  be  defined  in  (14)  and  /3  £  (0, 1).  Suppose  that  Fx(qx(/3))—F^(qx(/d))  >  0. 
We  consider  two  cases. 

First,  suppose  that  f3v  — >  /3,  with  f3v  <  (3  for  all  v.  which  implies  that  /3  £  [F^ ( qx(P )),  Fx(qx(P))\-  If 
/3  £  (F%  {qx {($)),  Fx{qx(f3))\,  then  qx{Pv)  =  qx{/3)  for  sufficiently  large  u.  Consequently,  for  sufficiently 
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large  v, 


\\Q^~Q^\\2  = 


X \\2 


+ 


(0  -  0)2dF(u) 
(r£„(uj)  -r£(u))2dF(uj)  + 


!  {u\X(uj)<qx{P)} 

fX  (,  ,\  i.X(,,\\2 

'Qp{X) 

When  X(u)  =  qx(/3u)  =  qX{P), 

f.x  r  ~x ( ,  ,\ Fx(qx((3))  ~  P 


f/3u(u)-fp  (u)  = 


>| X(u)>qx(P)}  V  1  -  PV  1  -  P 

FX{qX{P))  ~  P 


dP(u). 


(1  -  F)(Fx(qx(J3))  -  Fx(qX(m  (1  ~  P)(FX(qX(P))  -  Fx(qx(P))) 


Hence,  all  three  terms  in  the  above  integral  vanish  as  v  — >  oo.  If  (3  =  Fx (qx(fj)),  then  we  only  have 
that  qX{Pu)  FqX(P)  by  the  left-continuity  of  qx  and  in  fact  qX{Pv)  <  qx((3))  for  all  v.  Consequently, 


\\Q^-QfM  = 

+ 

+ 

+ 


,{u)\X(uj)<qxU31')} 


(0  -  0)2dP(w) 


(rpv(uj)  —  0)2dP(w) 


I  {u)\qx(P1')=X(uj)<qx(P)} 


'{u\qx  (PV)<X  (. U))=qx  (/?)} 


1  - 


— dF(u) 


w). 


l{u\qx(01')<qx{l3)<X{uj)}  \1  PV  1  P, 

Of  the  four  integrals,  the  first  and  fourth  ones  obviously  tend  to  zero.  For  the  second  one,  we  see  that 

F({u\qx(n  <  X(u)  =  qxm )  =  Fx{qx{^))  -  Fx{qx{^))  <  Fx(qx((3 ))  -  F^(qx(pv))  ->  0 

by  the  left-continuity  of  Fx  and  consequently  the  integral  also  tends  to  zero.  For  the  third  integral,  we 
find  that  when  X(uj)  =  qX{P) 


M  = 


FX{qX(P))  -  P 


FX(qX(P))  -  Fx(qx(/3)) 


1 


(1  -  P){Fx(qxm  -  Fx(qx(/3)))  (1  -  P)(Fx(qx(J3))  -  Fj(qx(J3)))  1  “  P 

Consequently,  the  third  integral  also  tends  to  zero. 

Second,  suppose  that  — v  (3,  with  (3U  >  j3  for  all  v.  If  f3  €  [Fx(qx(P)),  Fx{qX(P))),  then 

qX(Pu)  =  qX(P)  for  sufficiently  large  v  and  the  corresponding  argument  for  the  first  case  still  holds.  If 
/3  =  Fx(qx(/3)),  then  we  only  have  that  qx{Pv)  >  qX{P))  for  all  u.  Consequently, 


I \Qfr-Q$\\i  = 

+ 

+ 

+ 


(0  -  0)2dP(w) 


'{u\X(u>)<qx(P)} 


l{u)\qx(P)=X(u)<qx(01')} 


(0  -ff  (w))2dP(w) 


l{u\qx(P)<X(uj)=qx(P1')} 


dP(a,) 


{u\qx(P)<qx(P,')<X(u)}  PU  -*■  P 


u). 
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The  first  and  fourth  integrals  obviously  tend  to  zero.  For  the  second  one, 


_ Fx(qx(/3))  ~  P _ =  Fx(qx(/3))  ~  FxjqxjP)) 

(1  -  P)(Fx(qXm  -  Fx(qx{m  ”  (1  -  (3)(Fx(qx(/3))  -  F^qx^))) 


and  consequently  a  zero  integral.  For  the  third  integral, 


rX 

r a* 


(w) 


FX(qXm)  ~  F 

(1  -(J")(Fx(qx(l3"))-F-(qx(l3"))) 


1 


if  qX(fiv)  remains  bounded  away  from  qxifi)  because  then  Fx(j3v)  — >  Fx((3)  =  /3.  If  qx{Pv)  — >•  qx(/3), 
then  by  the  right-continuity  of  Fx  we  have  that 


P({cu  €  n\  qx(P )  <  X[u)  =  qx(n})  =  Fx(qx(n)  -  Fx(qx(n)  <  Fx(qx(n)  -  Fx(qXm  ->•  0. 


Consequently,  the  third  integral  also  tends  to  zero. 

The  situation  with  Fx(qX(/3))  —  Fx(qx(f3))  =  0  follows  with  similar  and  in  fact  simplified  arguments 
as  in  that  case  Fx  is  continuous  at  qxifi)  and  qx  is  continuous  at  ft. 

Finally,  we  consider  the  case  with  ,6  =  0  and  j3u\0.  Then, 


\\Qpv  ~  Qolll  = 
+ 


<{u\ X{u)>qx{Pv)}  V1 

(rpv(uj)  -  l)2d¥(uj)  +  f 

)  Ju 


- 1  dP(w) 


>^u{X) 


{ui\X{u)<qx{Pv)} 


(0  -  l)2dP(o;). 


Since  1/(1  —  f3v)  — >•  1,  the  first  integral  vanishes.  The  last  two  integrals  vanish  since  their  integrands 
are  bounded  and  Fx(qx(Pv))^  0-  □ 

We  are  then  in  a  position  to  characterize  risk  identifiers  of  mixed  superquantile  risk  measures.  For 
X  €  £2,  let 


Qx  :=cUQeC 


Q  =  J  q(/3)  d\(/3),  q  €  M,  q(f3)  €  Qx  for  A-a.e.  ft  €  [0, 1) 


(16) 


3.6  Theorem  (risk  identifiers  for  mixed  superquantiles).  For  X  €  C2,  the  set  Qx  is  convex  and 
satisfies  the  following. 


(i)  If  Q  €  Qx ,  then  Q  is  a  risk  identifier  of  IZ  at  X. 

(ii)  Iff  1/vT  —  (3  d\(/3)  <  oo,  then  Qx  is  nonempty  and  weakly  compact,  and  Q  G  Qx  whenever  Q 
is  a  risk  identifier  of  TZ  at  X.  Moreover,  Q  :=  f  q(/3 )  d\(fd),  where 

q  :  [0, 1)  — >  C2 ,  with  q(ff)  =  Qx  (defined  in  (15))  for  all  (3  €  [0, 1), 

is  furnishing  an  element  of  Qx . 


14 


Proof.  We  first  consider  (i).  Let  Q  G  Qx.  There  exists  sequences  {Qv}^=  1  C  C 2  and  {qu}r^=i  C  M 
such  that  || Qv  —  Q H2  ->  0,  Qv  =  f  qu(/d)  dX(fd),  and  G  for  all  v  and  A-a.e.  /3  €  [0, 1).  Then, 
for  every  z/, 


nx)  =  J  E[x<rm  dm  =  e  x  J  dm 


=  E[XQl 


where  the  middle  equality  follows  by  the  same  argument  as  in  the  proof  of  Theorem  3.1.  Since  by  the 
Cauchy- Schwartz  inequality  E[XQV ]  — >•  E[XQ\.  we  also  have  that  1Z(X)  =  E[XQ\ ,  which  establishes 

W- 

Next,  we  consider  (ii).  Suppose  that  f  l/\Jl  —  (d  dX(fd)  <  00.  We  proceed  toward  a  contradiction. 
Suppose  that  Q  €  Q  is  a  risk  identifier  of  1Z  at  X,  but  Q  0  Qx .  Then  there  must  exists  a  q  e  M  and 
B  €  £[<,,!)  such  that  q(/3)  €  Qg  for  A-a.e.  f3  €  [0, 1),  A (B)  >  0,  and  q(/3)  0  Qx  for  all  f3  €  B.  However, 
this  implies  that  E[Xq(/3)\  <  E[XQx]  for  all  (3  G  B  and  any  Qx  G  Qx .  Consequently,  E[XQ\  <  1Z(X), 
which  is  a  contradiction. 

Since  Q  is  weakly  compact  by  Theorem  3.1,  the  weak  compactness  of  Qx  follows  from  it  being 
a  closed  convex  subset  of  Q.  Finally,  we  show  that  Q  G  Qx.  The  conclusion  follows  when  we  have 
shown  that  q  G  AT  By  Proposition  3.5,  q  is  continuous  and  therefore  (£r0>i),  £>£2)-measurable.  Since 
for  (5  G  (0, 1) 


IIQf  III  = 


W  |  X (L0)>qxm  (!  -P)' 


rdP(o;)  + 


Iftp(X) 


Fx(qx(/3))  -  Id 


1-/3 


+ 


Ex{qx(P))  -  P 


(1  -  jd)2  L(1  -  (d)(FX(qXm  ~  F-(qx(ld)))  J 


(l-P)(Fx(qxm-Fx(qx(m\ 
(■ Fx(qx(ld))-Fx(qxm ) 


dP(w) 


+ 


(FX(qXm  -  fdf 


< 


< 


1-Id  (1  -  mFX(qXm  -  Fx(qx(/d))) 
(1  -/3)(Fx(qxm-ld) 

1-Id  ■  (1  -mFX(qXm-Fx(qX(m 


+ 


+ 


Fx(qx(/d ))  -  Fx(qx(/d )) 


1-/3  (1  -  p)(Fx(qx(P))  -  F-(qx(/d)))  1  -  /3 


and  || Oo  III  =  d,  we  find  that 


2  dm  <  \/2 


v'H 


d\{(d)  <  00. 


Consequently  q  £  M  and  Q  =  f  q(fd)  d\(fd)  G  Qx,  which  complete  the  proof.  □ 

We  observe  that  when  f  l/\/l  —  /d  d\((d)  =  00,  there  are  random  variables  X  G  C2  with  F(X)  =  00. 
In  this  case  it  might  not  be  necessary  to  select  q  in  (16)  with  q(/3)  G  Qx  for  A-a.e.  /3  G  [0, 1)  because 
f  E[Xq(/3)\  d\((d)  might  still  be  infinity.  For  the  special  case  of  a  second-order  superquantile  risk 
measure,  we  directly  obtain  the  following  corollary  without  this  complication. 


3.7  Corollary  For  a  G  [0, 1)  and  X  G  C2 ,  the  set 


Qa  :=  cl  {Q  G  £ 


Q  = 


q(fd)d(d,  q  G  Xi,  q(/d)  G  QX  for  m- a.e.  /3  G  [a,  1) 


1  —  a 
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is  nonempty,  convex,  and  weakly  compact.  Moreover, 

Q  G  Q&  if  and  only  if  Q  is  a  risk  identifier  of  lZa  at  X. 


Further  simplifications  are  possible  in  the  case  of  second-order  superquantile  risk  measures.  As 
usual,  we  interpret  0  times  — oo  as  zero  in  the  following. 

3.8  Theorem  (further  characterization  of  second-order  superquantile  risk  identifiers).  For  X  G  C? 
and  a  G  [0, 1),  Q *  is  the  closure  of  elements  Q „  G  Qa  given,  for  a.e.  w  G  fl,  by 


Qa  M  =  s 


1 

1—a 


log  1=7^)  +  rf(u)dp]  if  a  <  /(w)  <  1 

T=F  IaH  rf  (w)d0  ^  f(w)<a<  F(u) 

0  otherwise , 


where  r*  G  L '?  satisfies  (12)  and  F(u ;)  :=  Fx(X(u))  and  f(oj)  :=  Fx(X(yj)). 

The  specific  choice  f p  G  Cl2  given  in  (14)  results  in  the  risk  identifier  G  Q. £  having,  for  a.e. 
js  G  if. 


f  WF  lQg  T= 


&)={ 


1—a 


1 

1—a 

1 

1—a 

o 


F(w) 

1—a 


lo§WR 

F(uj)—a 


i  + 


l-F(w) 

FM-fH 


+ 


l-F(a)) 


F(u)-f(w)  T  F(«)-/(«) 


i  l— i'l 


1-, 

\uj) 

f(“) 

a 

if  a  <  /(a;)  =  F(w)  <  1 

if  a  <  /(w)  <  F(w) 

if  /(w)  <  a  <  F(ca)  and  /(ca)  <  F(u) 

otherwise. 


Proof.  For  oj  G  Ft  such  that  a  <  Fx(X(yj))  <  1 

r  i 


J {/>£(<*, 1)  |  X(u)>qxm  1  “  I3 

By  Proposition  3.4, 

1 


df3  =  [—  log(l  —  f})] 


FXXM)  =  log 


1  —  a 


1  -Fx(X(u>)) 


(17) 


QaH  = 


1  —  a 
1 

1  —  a 


dp  + 


rjf(u>)dp 


/{/3G(a,  1)  |  X(u >)>qXm  1  -  Z3  1)  |  A-(«)=<zxG8)} 


log 


1  —  a 


rFx(X(u ,)) 


+ 


r$(oj)dp 


1  “  FX  ( X (w))  JF~  (X(w)) 


which  proves  the  first  claim.  The  second  claim  follows  by  a  similar  argument. 

We  next  turn  to  the  specific  choice  of  fp.  For  a  <  Fx  (X(u))  =  Fx(X(uj))  <  1,  the  conclusion 
follows  trivially.  For  a  <  Fx  (X(uj))  <  Fx(X(uj)),  integration  gives  that 


rFx(X{w)) 

lF~(X(u)) 


rf(u>)dp  = 


rFx(X(u)) 


Fx(X(u))  -  P 


'f-(x^))  (1  -  P)(F x(X(u))  -  Fx(X(u))) 


dp 


=  1  + 


l-Fx(X(u)) 


log 


l-Fx(X(u)) 


Fx(X(u))-Fx(X(u))  "1  -Fx(X(u)Y 
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and  the  corresponding  conclusion  follows.  The  last  case  follows  by  a  similar  calculation.  □ 

The  situation  is  especially  simple  for  the  following  case. 

3.9  Corollary  Suppose  that  Fx  is  continuous  for  X  €  C2  and  a  €  [0, 1).  Then,  Q x  is  a  singleton12 
with  element  given,  for  a.e.  co  €  P,  by 


Q 


=  Jl=5loSr TFOTU))  ifa<Fx(X(u))<  1 


otherwise. 


It  is  obvious  that  expressions  of  risk  identifiers  provide  alternative  expressions  for  risk  measures. 
Specifically,  for  X  €  C2, 

1Z{X)  =  sup  [  X  (uj)Q(uj)dP(u)  =  [  X (u)Qx (u)dP(uj) , 

QeQJ  J 

for  any  Qx  E  Qx .  In  the  case  of  the  previous  corollary,  it  is  easy  to  see  that  the  second-order 
superquantile  risk  takes  the  simple  form 


na{x)  = 


i 


f,°°  1  -  a 

xlog- - m  r^dFx(x), 


1  “  a  Fix  (a)  l-Fx(x) 

where  qxip)  =  —  oo  for  a  =  0,  which  complements  the  expression  of  Theorem  2.4. 


4  Applications  to  Optimization  and  Regression 

In  applications  arising  in  optimization  under  uncertainty  and  risk-averse  regression,  one  is  not  only 
interested  in  the  risk  of  a  single  random  variable  X,  but  rather  of  a  parameterized  family  of  random 
variables  over  which  the  “best”  is  to  be  selected  according  to  some  criterion  and  constraints.  When 
the  criterion  and/or  the  constraints  are  given  in  terms  of  measures  of  risk  applied  to  this  family  of 
random  variables,  we  obtain  optimization  problems  involving  parameterized  risk.  Properties  of  these 
measures  of  risk  as  functions  of  the  parameters  as  well  as  formulae  for  the  functions’  (sub)gradients 
become  central.  In  this  section,  we  discuss  optimization  problems  involving  parameterized  mixed  and 
second-order  superquantile  risk.  In  particular,  we  develop  expressions  for  subgradients  relying  on  the 
risk  identifiers  of  Section  3. 

We  consider  a  family  of  random  variables  Xu  =  g(u,-),  u  €  J?” ,  generated  by  the  function  g  : 
Mn  x  Q  M.  Consistent  with  the  previous  sections,  we  assume  that  Xu  €  C2  for  all  u  E  Rn .  For  a 
weighting  measure  A  and  the  corresponding  mixed  superquantile  risk  measure  1Z,  as  before  given  by 

U{Xu)  =  J  qXu{P)  dm, 

we  get  a  function 

f(u):=K(Xu),  u€Mn,  (18) 

12  Again,  uniqueness  is  up  to  on  a  set  of  P-measure  zero. 
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representing  parameterized  risk.  One  might  then  proceed  with  determining  a  u  €  Mn  that 

minimizes  f(u)  over  a  subset  of  ]Rn 

or,  alternatively,  with  determining  a  u  €  Mn  that 

minimizes  some  criterion  function  of  u  subject  to  f{u )  <  0  and  possibly  other  constraints. 

Algorithms  such  as  cutting  plane  and  bundle  methods  for  solving  these  optimization  problems  require 
expressions  for  (sub)  gradients  of  /.  Justification  for  these  approaches  is  provided  by  the  Convexity 
Theorem  of  [25],  which  establishes  that  /  is  convex  whenever  g(-,u ;)  is  convex  for  a.e.  oj  €  12. 

In  the  remainder  of  the  paper,  we  derive  expressions  for  subgradients  of  /,  but  refrain  from  discussing 
full  algorithms;  see  for  example  [15,  12,  29]  for  risk  minimization  algorithms  based  on  dual  approaches 
and  [30]  for  related  subgradient  expressions.  However,  we  end  the  paper  with  a  discussion  of  primal 
and  dual  methods  in  the  context  of  superquantile  regression. 


4.1  Subgradients  of  Parameterized  Risk 

We  restrict  the  attention  to  the  case  with  f  \/\J\  —  a  d\(a)  <  oo  which  ensures  the  finiteness  of  1Z  on 
L '?  and  also  the  weak  compactness  of  Q.  We  equip  JRn  x  C2  with  the  product  topology  generated  by 
the  norm  topology  on  lRn  and  the  weak  topology  on  C2 .  The  convergence  of  points  in  lRn  x  C2  in  this 
weak  sense  is  denoted  by  — . 

For  notational  convenience,  we  let  h  :  lRn  x  C2  — >•  M  be  given  by 

h(u,Q)  :=  J  g(u,ui)Q(uj)dF(uj).  (19) 

Properties  of  this  function  are  established  next. 


4.1  Proposition  Consider  h  in  (19)  and  suppose  for  an  open  set  U  C  Mn  that 

(i)  there  exists  an  L  £  C2  such  that 

\g(u,u)  —  g(v! ,oj) |  <  L(u>)\\u  —  •u/||  for  all  u,u'  €  U  and  a.e.  w  €  H 

(ii)  for  every  i  =  1  there  exists  an  C  P,  with  P{Pi}  =  1.  and  an  Li  €  C2  such  that 

dg(u,u>)/dui  exists  for  u  €  U  and  co  €  Pj,  and 


dg{u,u)  dg(u',uj) 


dui 


dui 


<  Li(u)\\u  —  u  ||  for  all  u,u'  €  U  and  lo  €  P* 


(in)  g(v ,  • ),dg(vl ,  • )/dui  €  C2  for  some  v,vl  G  U,  i  =  1, ...,  n. 

Then,  li  is  weakly  continuous  on  U  x  C2  and  X7uh  exists  and  is  likewise  weakly  continuous  on  U  x  C2 . 
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Proof.  First  we  consider  h,  which  is  well-defined  and  finite  on  U  x  C2  from  assumptions  (i)  and 
(iii).  Suppose  that  (uv,Qv)  w  ( u,Q ),  with  uu  ,u  G  U  and  Qv  ,Q  G  C2 .  Then  by  the  triangle  and 
Cauchy- Schwartz  inequalities  and  assumption  (i), 

\h(uu,  Qv)  —  h(u,  Q)\  <  J[g(uu,u)  -  g(u,u)]Qu(uj)clF(uj)  +  J  g(u,u})[Qv(uj)  —  Q(oj)]dF(uj) 

<  II g(uv,  •)  -  g(u,  OlhllQ^lb  +  J g(u,u)[Qu(uj)  -  Q(u)]dF(u) 
<(E[L2])1/2\\ut'-u\\\\Qv\\2  +  f  g(u,u)[Cnu)-Q(u,)\d P(w) 

By  the  Uniform  Boundedness  Principle,  {HQ^Ihl^i  is  bounded  and  the  first  term  therefore  vanishes. 
Since  assumptions  (i)  and  (iii)  imply  that  g(u,ui)  G  C?  for  all  u  G  U,  the  second  term  vanishes  by  the 
weak  convergence  of  Qu  to  Q. 

Second  we  consider  Vu/i.  Following  a  standard  argument  and  the  Dominated  Convergence  Theorem 
(see  for  example  the  proof  of  Theorem  7.44  in  [32]),  we  find  that  for  every  u  G  U  and  Q  G  C2,  S7uh(u,  Q ) 
exists  and  is  given  by 

Vuh(u,Q )  =  J  Vug (u,u)Q(u)dP(uj). 

Repeating  the  above  argument  with  g  replaced  by  dg/diii  and  assumption  (i)  by  assumption  (ii)  estab¬ 
lishes  the  claim  about  Vu/i.  □ 

In  view  of  Proposition  4.1,  the  following  conclusions  is  a  direct  consequence  of  [28,  Theorem  10.31]. 

4.2  Theorem  (subdifferentiability  of /).  Suppose  that  the  assumptions  of  Proposition  4.1  holds.  Then, 
f  in  (18)  is  locally  Lipschitz  continuous  on  U  and  strictly  differentiable 13  where  it  is  differentiable.  There 
exists  a  set  D  C  U  such  that  U  \  D  is  negligible 1  ,  /  is  differentiable  on  D,  and  the  gradient  V/  is 
continuous  relative  to  the  set  D. 

Moreover,  the  directional  derivative  of  f  at  u  G  U  in  direction  v  G  Mn  is 
df(u )  (v)  =  max  j  (E  [ Vug(u ,  -)Q\,v)  Q  G  Qaiu j 
and  the  subdifferential  of  f  at  u  G  U  is 

df(u)  =  con  { 1 E  ( Vug{u , -)Q]  |  Q  G  QaM  }  , 
where  Qa(u’’l  is  given  in  (16)  with  X  replaced  by  g(u,  •). 

□ 

We  observe  that  when  A  =  Aa,  i.e. ,  the  focus  is  on  a  second-order  superquantile  risk  measure  lZa, 
then  Qs(u’-)  is  fully  characterized  by  Theorem  3.8.  In  particular,  the  latter  half  of  that  theorem  provides 
a  specific  risk  identifier  Q  G  Q9^u^  that  is  easily  calculated  when  fl  has  finite  cardinality.  Such  a  risk 
identifier  then  provides  the  subgradient  E[Xug(u,  ■ )Q ]  of  /,  which  also  is  easily  calculated  in  this  case. 

13Recall  that  /  :  Rn  — >  R  is  strictly  differentiable  at  a  point  x  if  f{x)  is  finite  and  there  is  a  vector  v  £  Rn  such  that 
(f(x')  —  f(x)  —  ( v,x '  —  x))/\x'  —  x\  — »  0  whenever  x,x'  —>  x  and  x'  x;  see  [28,  Definition  9.17]. 

14A  subset  of  a  set  of  Lebesgue  measure  zero  is  negligible. 
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4.2  Application  to  Superquantile  Regression 

Superquantile  regression  as  laid  out  in  [22]  (see  also  [20]  and  [13])  resembles  quantile  regression,  but 
instead  of  estimating  conditional  quantiles,  it  focuses  on  conditional  superquantiles.  Specifically,  we 
find  that  for  Y  £  L '?  and  a  €  (0, 1), 

{qv(a)}  =  argmiri £a(Y  —  uq),  where  £a(Y)  :=  Va(Y)  —  E[Y ] 

uo£R 

is  a  measure  of  error  given  in  terms  of  the  measure  of  regret 15  VQ  defined  in  (5).  In  the  same  manner  as 
minimizing  mean-squared  error  yields  an  expectation  and  the  foundation  for  least-squares  regression, 
and  minimizing  a  Koenker-Basset  error  yields  a  quantile  and  the  foundation  for  quantile  regression, 
minimizing  £a  leads  to  superquantile  regression. 

Superquantile  regression  deals  with  the  problem  of  approximating  a  random  variable  Y  €  C2  by  a 
combination  of  more  accessible  random  variables  X±,  X%, ...,  Xn  £  C2,  such  that  the  error  as  quantified 
by  £a  is  minimized.  Hopefully,  the  knowledge  of  X  =  (Ai,...,Xn)  would  then  provide  reasonably 
accurate  predictions  of  Y .  Limiting  the  scope  to  affine  regression  functions,  superquantile  regression 
then  needs  to  solve  the  problem 


min  £<*  (Y  -  [w0  +  («,  A)]) 

uo£R;u£R 

to  obtain  regression  coefficients  uq  and  u.  That  is,  the  regression  coefficients  ( uo,u )  are  selected  such 
that  the  error  between  Y  and  the  model  uq  +  (u,  X)  is  minimized. 

We  show  in  [22]  that  this  problem  can  be  decomposed  into  the  two  problems 

1  r1 

(i)  find  u  £  argrnin  — —  /  qg(u,.)(f3)d/3  -  E[g(u ,  •)]  and  (ii)  find  u0  =  qg^.){a), 

u£Rn  1  ®  J  a 

where  for  each  u  £  Mn, 

g(u,-)  =  Y  -  (u,  X) 

is  a  random  variable  defined  on  the  sample  space  H  =  J?,n+1 ,  with  sigma-algebra  BRn+ 1,  and  probability 
P  given  by  the  distribution  of  (A,  Y).  The  problem  (i)  is  that  of  minimizing  a  second-order  superquantile 
of  g(u,  •)  minus  the  expectation  of  g(u,  •).  Since  E\g(u,  •)]  =  E[Y]  —  ( u ,  E[X])  is  a  deterministic  quantity, 
this  problem  is  essentially  in  the  form  discussed  earlier  in  the  section:  to  minimize  a  mixed  superquantile 
risk  measure,  in  fact  a  second-order  superquantile  risk  measure. 

Suppose  that  the  distribution  P  is  supported  on  the  points  {(xJ,  y-?')}^=1  C  ]Rn+l  with  P{( = 
p7,  j  =  1,  ...,zq  as  is  the  case  in  practice  when  the  regression  relies  on  the  observed  data  {(x7,  yJ')}^=1. 
Then,  the  evaluation  at  a  given  u  £  Mn  of  the  objective  function 

/(«)  =  [  Qg(u,- )(P)dP  -  E\g(u ,  •)] 

of  problem  (i)  and  a  corresponding  subgradient  are  achieved  as  follows:  Determine  the  cumulative 
distribution  function  of  g(u,  •)  and  use  the  formula  in  the  second  half  of  Theorem  3.8,  with  X  replaced 

15We  refer  to  [25]  for  a  general  treatment  of  measures  of  error  and  regret. 
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by  g(u,  •),  to  determine  a  risk  identifier  Qa"'  \  This  computation  can  be  obtained  in  0{y\ogv)  time, 
with  sorting  of  {y9  —  (u,  xJ)}t'=1  to  obtain  the  cumulative  distribution  function  being  the  bottleneck. 
Then,  in  view  of  Theorem  4.2,  the  function  value  and  a  subgradient  are  readily  available  through 

V  V 

f  0)  =  ~  -  (u,  X3)) 

3= 1  J=1 


and 

t'  Z^ 

V/(it)  =  where  =  (x9  ,y9). 

3= 1  j=l 

We  note  that  the  assumptions  of  Proposition  4.1  are  easily  verified  in  this  case  due,  in  part,  to  the 
affine  form  of  g(-,  cu).  Consequently,  each  iteration  of  a  cutting-plane  method  or  bundle  method  requires 
therefore  computational  time  of  order  0{y  log  v)  as  a  function  of  the  number  of  data  points.  The  number 
of  iterations  needed  would  depend  on  the  method,  n  (the  number  of  explanatory  variables),  and  other 
factors.  In  comparison,  a  “primal”  method  proposed  in  [22]  for  solving  the  same  problem  requires  the 
solution  of  a  linear  program  with  n  +  0(u2)  variables  and  0(z/2)  inequality  constraints.  It  is  therefore 
clear  that  for  small  n  and  large  u.  which  is  typical  in  regression  problems,  a  dual  method  relying  on 
the  expressions  derived  in  this  paper  might  outperform  the  linear-programming-based  approach;  see 
[13]  for  empirical  evidence  supporting  this  claim.  In  fact,  even  storage  of  the  linear  program  becomes 
challenging  for  large  u. 


A  Appendix 

As  support  for  proving  Theorem  2.3  in  Section  2,  we  need  the  following  consequence  of  the  Fubini- 
Tonelli’s  Theorem. 


A.l  Proposition  Suppose  that  (A,  A ,  //)  and  (y,  B,  u)  are  sigma-finite  measure  spaces.  If  f  :  X  xT  — >• 
M  is  measurable  with  respect  to  the  product  sigma-algebra  on  X  x  y  and  g  :  X  x  y  — >•  IR  is  integrable 
with  respect  to  the  product  measure  /i  x  v,  with  f(x,  y)  >  g(x,  y)  for  (p  x  v)-a.e.  (x,  y)  £  X  x  y,  then 
the  following  hold: 


0) 

(») 

(Hi) 


the  function  hi  =  f  f(x,  •)  dp(x)  is  B-measurable, 
the  function  li2  =  /  f{-,y )  dv{y)  is  A-measurable, 
and 


J  f  d(px  u) 


f(x,y)  dp(x) 


dv(y)  =  j 


f(x,y)  dv(y) 


dp(x). 


Proof.  We  recall  that  the  integral  of  the  sum  of  a  nonnegative  measurable  function  and  an  integrable 
function  equates  the  sum  of  the  individual  integrals  under  the  usual  rules  for  handling  addition  with 
infinity.  Then, 


hi  = 


f(x, -)dp(x)  =  J (/  -  g)(x, -)dp(x)  +  J  g(x,  -)dp(3 
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is  immeasurable  since  both  terms  on  the  right-hand  side  are  immeasurable  by  the  Fubini-Tonelli  The¬ 
orem.  A  similar  argument  yields  the  conclusion  for  h-2-  The  final  assertion  follows  by  applying  the 
Fubini-Tonelli  Theorem  to  f  —  g  and  g ,  and  the  above  rule  about  interchange  of  summation  and  inte¬ 
gration.  □ 


Proof  of  Theorem  2.3.  For  every  X  E  C2 ,  qx  is  continuous  and  finite  on  [0, 1)  and  therefore  i3[o.i)- 
measurable.  Moreover,  qx  >  E[X]  and  therefore  TZ(X)  >  E[X\  >  — oo.  Consequently,  TZ  is  well-defined 
with  values  in  [Fi[X],oo].  Its  regularity  and  positive  homogeneity  follow  directly  from  those  of  1ZQ ;  see 
[25].  Since  qx  is  strictly  increasing  on  [0, 1)  for  nonconstant  X ,  we  have  that  if  A({0})  <  1,  then 


H{X)  =  £[X]A({0})  +  [  qx(P)  dX(fi)  >  £[A]A({0})  +  E[X}(  1  -  A({0})  =  E[X] 

J  l>/?>0 


and  the  strict  lower  bound  follows.  From  (6), 

U{X)  <  £  E[X]  +  -^=L=  d\{^)  =  E[X\  +  a(X)  £  -£=  d\{P)  <  oo 

under  the  stated  assumption,  which  establishes  the  corresponding  finiteness  on  C2 .  In  the  case  of 
sup  A  <  oo,  finiteness  of  1Z{X)  follows  trivially. 

We  next  consider  the  alternative  expression.  By  definition, 


nx)  = 


qx(fi)ip(a,fi)dfi 


dX(a ), 


(20) 


with  fi>(ot,fi)  =  if0<a</3<l  and  ip  (a,  (5)  =  0  otherwise.  We  equip  [0,1)  x  (0,1)  with 

the  product  measure  A  x  m  defined  on  the  product  sigma-algebra  B\ 0;i)  <8>£>(o,i)-  It  is  obvious  that  ij)  : 
[0, 1)  x  (0, 1)  ^  Mis  (^[oq)<8>i3(o,i))-measurable  and  likewise  qx,  viewed  as  a  function  on  [0, 1)  x  (0, 1)  that 
is  constant  in  its  first  argument,  due  its  monotonicity.  Consequently,  the  function  (a,  j3)  >-)•  qx(f3)il>(a,  /3) 
is  measurable  in  the  same  sense.  Then,  we  look  toward  the  interchange  of  integration  order  in  (20). 

We  consider  three  cases,  (i)  Suppose  that  X  >  0  a.e.  Then,  qx  >  0  and  qxip  >  0,  and  the 
interchange  of  integration  order  is  permitted  by  Tonelli-Fubini’s  Theorem,  (ii)  Suppose  that  X  <  0 
a.e.  Then,  —qx  >  0  and  —  qx^  >  0,  and  the  interchange  of  integration  order  is  again  permitted  by 
Tonelli-Fubini’s  Theorem,  (iii)  Suppose  that  neither  (i)  nor  (ii)  holds.  Then,  there  exists  a  fix  €  (0, 1) 
such  that  qx(fi)  >  0  for  fi  >  fix  and  qx(fi)  <  0  for  fi  <  fix •  In  view  of  Proposition  A.l,  it  suffices  to 
find  an  integrable,  lower-bounding  function  of  qx'fi’-  Let  g  :  [0, 1)  X  (0,1 )  —>  M  be  given  by 


g(a,fi) 


qx  (fi)/{l-  fix)  if  0  <  a  <  fi  <  fix 
<  qx  (fi)  if  0  <  a  <  fi  <  1,  fix  <  fi 

0  otherwise. 


Clearly,  qxfij  >  g  and 


J  \g\d(X  x  m )  < 


1 

1  —  fix 


qx\d(X  x  m) 


— r 

1  -  fix  Jo 


\qx{fi)\dfi 


dX(a), 


(21) 
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where  the  equality  follows  by  Tonelli-Fubini’s  Theorem.  The  inner  integral  simplifies  to 


\qx(/3)\d/3  =  /  qx{P)dp~ 

Jpx 


rPx 


rPx 


qx{P)dP  =  (1  -  / 3x)qx(Px )  -  /  qx{P)dp. 


The  last  term  requires  further  simplification.  Recall  that  for  a  €  (0, 1), 


1 

a 


qx(/3)d/3 


1 

a 


q~x{/3)d/3 


-q-x(  1  -  «)• 


Applying  this  result,  the  inner  integral  from  above  simplifies  further  to 

i 

\qx{P)\dp  =  (1  —  Px)qx(Px)  +  Pxq-x(  1  —  Px)  <  oo. 

Consequently  in  view  of  (21),  g  is  integrable  and  therefore  furnishes  the  necessary  lower-bounding, 
integrable  function  in  Proposition  A.l,  which  completes  part  (iii).  We  are  therefore  permitted  to 
interchange  the  order  of  integration  in  (20)  and  get 


K(X)  = 


/ o  L./0 


qx(P)^(a,  P)dp 


d\(a)  =  /  qx(P) 


ft)  dX(a) 


dp  =  /  qx(P)(p{P)dp, 


where  the  last  equality  follows  from  the  definition  of  ip. 

The  final  assertions  follow  from  recognizing  that  the  Lebesgue-Stieltjes  measure  dtp  associated  with 
a  function  (p  has  d<p(a)  =  j  '  dX(a)  for  a  weighting  measure  A  on  [0, 1).  □ 

Now  we  articulate  other  definitions  and  technical  results  required  in  the  paper. 


A. 2  Definition  Let  (T,  A,  fi)  he  a  complete  measure  space,  with  p  sigma-finite,  X  a  separable  reflexive 
Banach  space,  and  A4  a  linear  suhspace  of  the  linear  space  of  all  (A,  Bx) -measurable  functions  x  :  T  — » 
X.  The  set  M.  is  (A,  Bx) -decomposable  if,  whenever  x  €  M.  and  xo  :  S  —>  X  is  a  bounded  (A,  Bx)~ 
measurable  function  on  a  set  S  G  A,  with  p(S)  <  oo,  then  the  function  y  :T  — >•  X  given  by 


also  belongs  to  M. . 


y(t) 


xo  (t)  if  t  €  S 
x(t)  if  t  €  T  \  S 


A. 3  Definition  In  the  notation  of  Definition  A. 2,  we  say  that  a  function  f  :  T  x  X  — >•  (—00,00]  is  a 
normal  integrand  if  the  following  hold: 

(i)  f  is  (A  <8>  Bx) -measurable  and 

(ii)  for  every  t  £  T,  f(t,  •)  is  lower  semicontinuous  on  X  and  not  identical  to  00. 


A. 4  Proposition  Suppose  that  the  conditions  and  notation  of  Definition  A. 2  hold  and  /  Txl'^ 
(—00, 00]  is  a  normal  integrand.  Then,  the  following  hold: 
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(i)  the  functions  t  i->-  inf £ex  f(t,0  and  t  i->-  f(t,x(t)),  with  x  :  T  — >•  X  (A,  Bx) -measurable,  are 
A-measurable  and 

(ii)  if  M  is  (A,  Bx) -decomposable  and  there  exists  an  x  £  M.  such  that  f  f(t,x(t))  dp{t)  <  oo,  then 

inf  /  =  [  Vit)  dn(t ),  where  tp(t)  =  inf  /(*,£)■  (22) 

Proof.  First,  we  consider  t  i->-  inf For  measurable  spaces  (X\,A\)  and  (#2^2),  we  recall 
that  a  set-valued  mapping  S  :  X\  ^4  X2  is  (Ai,  A2)-measurable  if  its  graph  is  measurable  in  the  sense 
that 

{(xi,x2)  €  X\  X  X2  |  X2  £  S(xi)}  £  Ai  <8>  A2, 

where  Ai<8>  A2  is  the  product  sigma-algebra  generated  by  A\  and  A2.  Since  /  is  a  normal  integrand,  the 
set-valued  mapping  1 1->-  epi  f(t,  •)  is  A-measurable  and  closed- valued;  see  for  example  [17,  Proposition 
1].  By  [17,  Theorem  1(f)],  there  exists  a  countable  collection  of  A-measurable  functions  gi  : 

T->  X  x  J?  of  the  form  gt(t)  =  (xi(t),  al(f)).j  X{(t)  £  X  and  cq(t)  £  1R,  such  that 

epi/(i,  •)  =  d{gi(t)}i£l  for  all  t  £  T, 

where  cl  denotes  closure.  The  mapping  t  >-)•  ai(t)  is  also  A-measurable.  Consequently, 

inf  fit,  0  =  inf  otAt)  for  all  t  £  T 

iei 

and  the  conclusion  follows  from  the  fact  that  the  pointwise  infimum  of  a  countable  collection  of  mea¬ 
surable  functions  is  a  measurable  function. 

Second,  we  consider  t  e-)-  f(t,x(t)),  which  is  a  composition  of  /  with  the  measurable  mapping 
t  i->-  (t,x(t))  and  therefore  measurable. 

Third,  we  establish  part  (ii)  by  following  the  arguments  in  the  proof  of  Theorem  2  in  [17].  By 
assumption  there  exists  a  function  x\  £  A4  and  a  /r-integrable  function  07  :  T  — >  1R  such  that 

f{t,x\{t))  <  07(f)  for  every  t  £  T. 

Since  <p(t)  <  f(t,x(t))  for  every  function  x  £  M  and  t  £  T  by  definition  and  p  is  A-measurable  by 
part  (i),  the  integral  of  ip  is  well-defined  and  either  finite  or  equals  —00.  Consequently,  the  inequality 
>  holds  in  (22).  Now,  let  7  €  JR  be  such  that 

J  (p{t)  dnit)  <  7.  (23) 

We  will  prove  the  existence  of  a  function  x  £  A4  such  that 

J  f(t,x(t))  dp(t)  <  7,  (24) 

thereby  establishing  part  (ii).  From  (23)  and  the  properties  of  (T,  A, //),  there  exists  a  /j-integrable 
function  07  •  T  M  such  that  <p{t)  <  07 (t)  for  every  t  £  T  and 

J  a0(t )  dp{t)  <  7.  (25) 
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We  define  the  set- valued  mapping  S  :  T  X  by 


S(t)  =  {£  G  A  |  /(*,£)  <  a0(t)}  for  t  G  T. 

Since  the  function  (f,  £)  eG  f(t ,  £)  —  ao(t)  is  (A  <S>  £>,r) -measurable,  S  is  also  M-measurable.  Moreover, 
S(t)  is  for  each  1  £  T  closed  and  nonempty.  Since  S  is  A- measurable,  there  exists  a  A- measurable 
selection  xq.  i.e. ,  a  M-measurable  function  xq  such  that  xq (t)  G  S(t)  for  every  1  G  T;  see  for  example  the 
corollary  of  Theorem  1  in  [17].  Since  (25)  holds,  there  exists  a  measurable  set  To  C  T,  with  /j(Tq)  <  oo, 
such  that 

/  ao(t)  dn(t)  +  /  ai(t)  d/j,(t)  <  7.  (26) 

■jTq  Jt\T0 

By  the  construction  of  S  in  terms  of  ao,  the  measurable  selection  xq  can  be  chosen  to  be  bounded  on 
To.  Let  x  :  T  — >•  X  be  such  that  x(t)  =  xq (t)  for  t  G  To  and  x(t)  =  x\(t)  for  t  G  T\Tq.  Then,  x  G  M.  by 
the  assumption  of  decomposability,  and  we  have  that  f(t,x(t))  <  «o (t)  for  t  G  To  and  f(t,x(t))  <  «i(t) 
for  t  G  T  \  To-  From  (26)  we  then  conclude  (24),  which  establishes  part  (ii).  □ 

A. 5  Lemma  If  q  :  [0, 1)  — >•  C?  is  (>Sr0,i) ,  -measurable,  then 

(i)  the  function  fi  :  [0, 1)  x  fl  — >•  1R  given  by  =  q{j3){ui )  is  (Sr0ii)  <8>  T") -measurable,  and 

(ii)  the  function  f-2  :  [0, 1)  — >  1R  given  by  f 2,(13)  =  ||g(/3)||2  is  Bi0  ^-measurable. 

Proof.  For  part  (i)  simply  observe  that  f\  =50/1,  where  h  :  [0, 1)  X  f l  ^  C2  x  f 2,  with  h(a,u )  = 
(q(a),u),  and  g  :  C2  xf}  ^  M,  with  g(Q,  u>)  =  Q(u>).  The  conclusion  then  follows  from  the  measurability 
of  q  and  elements  of  £2,  and  the  fact  that  composition  of  measurable  functions  is  measurable.  Next, 
we  consider  part  (ii).  A  trivial  extension  of  part  (i)  establishes  that  the  function  (f3,ui)  i-G  [q(/3)(u)]2 
is  (i3[o,i)  ®  -T)-measurable.  Since  it  is  also  nonnegative,  it  follows  from  Tonelli-Fubini’s  Theorem  that 
[. f-2(-)}2  is  Br0  j j-measurable.  □ 

The  following  is  a  direct  consequence  of  Definition  A. 2. 

A. 6  Proposition  The  set  A4  is  (jB[0,i) ,  B^) -decomposable. 
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